TST/BTD: An End-to-End Visual Recognition System

نویسندگان

  • Taehee Lee
  • Stefano Soatto
چکیده

We describe a visual recognition system operating on a hand-held device. Feature selection and tracking are performed in real-time, and used to train a template-based classifier during a capture phase prompted by the user. During normal operation, the system scores objects in the field of view based on their ranking. Severe resource constraints have prompted a re-evaluation of existing algorithms improving their performance (accuracy and robustness) as well as computational efficiency. We motivate the design choices in the implementation with a characterization of the stability properties of local invariant detectors, and of the conditions under which a template-based descriptor is optimal. The analysis also highlights the role of time as “weak supervisor” during training, which we exploit in our implementation.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Simulation of Position Based Visual Control and Performance Tests of 6R Robot

This paper presents simulation and experimental results of position-based visual servoing control process of a 6R robot using 2 fixed cameras. This method has the ability to deal with real time changes in the relative position of the target-object with respect to robot. Also, greater accuracy and independency of servo control structure from the target pose coordinates are the additional advanta...

متن کامل

Combining Residual Networks with LSTMs for Lipreading

We propose an end-to-end deep learning architecture for wordlevel visual speech recognition. The system is a combination of spatiotemporal convolution, residual and bidirectional LongShort Term Memory networks. We trained and evaluated it on the Lipreading In-The-Wild benchmark, a challenging database of 500-size vocabulary consisting of video excerpts from BBC TV broadcasts. The proposed netwo...

متن کامل

Improving Speaker-Independent Lipreading with Domain-Adversarial Training

We present a Lipreading system, i.e. a speech recognition system using only visual features, which uses domain-adversarial training for speaker independence. Domain-adversarial training is integrated into the optimization of a lipreader based on a stack of feedforward and LSTM (Long Short-Term Memory) recurrent neural networks, yielding an end-to-end trainable system which only requires a very ...

متن کامل

اثربرنامۀ توانبخشی شناختی پریا بر بهبود توانایی بازشناسی حالات هیجانی در کودکان مبتلا به اختلال اتیسم با عملکرد بالا

Background & Aims: Autism spectrum disorder is neurodevelopmental disorder characterized by social-communication difficulties and stereotyped behaviors. The present study evaluates the effects of cognitive rehabilitation based on inverse imitation on recognition of basic emotion in children with high functioning autism disorder. Materials Method: The method was quasi-experimental and single-...

متن کامل

Fault Detection and Isolation of Vehicle Driveline System

Vehicle driveline system and its working accuracy play an important role in the performance of car. The purpose of this study is to provide an appropriate mechanism for investigating, identifying and determining the position and size of defects in the vehicle power transmission system. This is based on the patterns of the residual signal, obtained from a simulated model of the system. Neuro-...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010